首页> 外文OA文献 >Chip-level and multi-node analysis of energy-optimized lattice-Boltzmann CFD simulations
【2h】

Chip-level and multi-node analysis of energy-optimized lattice-Boltzmann CFD simulations

机译:能量优化晶格Boltzmann的芯片级和多节点分析   CFD模拟

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Memory-bound algorithms show complex performance and energy consumptionbehavior on multicore processors. We choose the lattice-Boltzmann method (LBM)on an Intel Sandy Bridge cluster as a prototype scenario to investigate if andhow single-chip performance and power characteristics can be generalized to thehighly parallel case. First we perform an analysis of a sparse-lattice LBMimplementation for complex geometries. Using a single-core performance model,we predict the intra-chip saturation characteristics and the optimal operatingpoint in terms of energy to solution as a function of implementation details,clock frequency, vectorization, and number of active cores per chip. We showthat high single-core performance and a correct choice of the number of activecores per chip are the essential optimizations for lowest energy to solution atminimal performance degradation. Then we extrapolate to the MPI-parallel leveland quantify the energy-saving potential of various optimizations and executionmodes, where we find these guidelines to be even more important, especiallywhen communication overhead is non-negligible. In our setup we could achieveenergy savings of 35% in this case, compared to a naive approach. We alsodemonstrate that a simple non-reflective reduction of the clock speed leavesmost of the energy saving potential unused.
机译:内存绑定算法在多核处理器上显示出复杂的性能和能耗行为。我们选择Intel Sandy Bridge群集上的晶格-玻尔兹曼方法(LBM)作为原型方案,以研究是否以及如何将单芯片性能和功耗特性推广到高度并行的情况。首先,我们对复杂几何形状的稀疏晶格LBM实现进行分析。使用单核性能模型,我们根据实现细节,时钟频率,矢量化以及每个芯片的活动核数来预测芯片内部的饱和特性和根据解决方案所需的能量的最佳工作点。我们证明了高单核性能和正确选择每个芯片的活动核数是最基本的优化方法,可以以最低的能量解决最低的性能下降。然后,我们推断到MPI并行级别,并量化各种优化和执行模式的节能潜力,在这些准则中,我们发现这些准则甚至更重要,尤其是在通信开销不可忽略的情况下。与单纯的方法相比,在这种情况下,在这种情况下,我们可以节省35%的能源。我们还演示了时钟速度的简单非反射式降低将大部分节能潜力保留下来。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号